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We address the properties of excitons in monolayer M 0 S 2 from a theoretical point of view, showing 
that low-energy excitonic states occur both at the Brillouin zone center and at the Brillouin-zone 
corners, that binding energies at the Brillouin-zone center deviate strongly from the (n — l/2)~ 2 
pattern of the two-dimensional hydrogenic model, and that the valley-degenerate exciton doublet 
at the Brillouin-zone center splits at finite momentum into an upper mode with non-analytic linear 
dispersion and a lower mode with quadratic dispersion. Although monolayer M 0 S 2 is a direct-gap 
semiconductor when classified by its quasiparticle band structure, it may well be an indirect gap 
material when classified by its excitation spectra. 

PACS numbers: 71.35.-y, 78.67.-n, 73.22.-f 


I. INTRODUCTION 

In monolayer form the group VI transition-metal 
dichalcogenides (TMDs) like M0S2 are an interesting 
class of semiconductors, and one that has recently re¬ 
ceived considerable attention . 1 ' 1 'lIn these materials con¬ 
duction and valence bands are both dominantly d- 
electron in character and have band extrema located 
at the triangular lattice Brillouin-zone-corners K and 
K'. Because their structure breaks inversion symme¬ 
try, coupling is allowed between real spin and valley 
pseudospirP and gives rise to valley-dependent optical 
selection rulesP® Because of relatively large carrier effec¬ 
tive masses, reduced screening, and carrier confinement 
in a single atomic layer, their electron-hole interactions 
are much stronger than in conventional semiconductors. 
Monolayer TMDs therefore host exceptionally strongly 
bound excitons and trions that have been extensively 
studied both experimentally and theoretically.^®^ 

In this article we report on a theoretical study of ex¬ 
citon energies and wave functions in M0S2 as a func¬ 
tion of momentum across the full Brillouin zone. We 
identify important aspects of 2 D-TMD exciton physics 
that are controlled by mirror, three-fold rotational, and 
time-reversal discrete symmetries. We calculate the opti¬ 
cal conductivity, which reflects the properties of excitons 
with zero center-of-mass momentum and exhibits a set of 
peaks split by electron-hole binding energies as usual, but 
also by large valence band spin-orbit coupling energies. 
The spin splitting of valence band is conventionally used 
to classify absorption peaks into A and B series. The 
exciton energy pattern is distinctly different from that of 
a 2 D hydrogenic model. In particular the four A2 P states 
are lower in energy than the corresponding 2 s states, and 
not degenerate. 

Finite-momentum excitons are optically inactive, but 
can nevertheless play an important role in hot carrier 
relaxation and in valley dynamicsP^®^ We find that for 
both A and B excitons, the valley degenerate states at 
the Brilliouin-zone center split at small momentum into 
a lower mode with quadratic dispersion and an upper 


mode with non-analytic linear dispersion. This unusual 
pattern is due to valley coherence established by electron- 
hole exchange interactions. 

Low energy exciton states appear both near the 
Brillouin-zone center and near the Brillouin-zone corners. 
There are two distinct types of Brillouin-zone corner exci¬ 
tons. One type has electrons and holes in opposite valleys 
( K , K'), while the other has holes in the T valley and 
electrons in either the K or K' valley. Although mono- 
layer M0S2 is a direct-gap semiconductor as judged by its 
band structure, because of these Brillouin-zone corner ex¬ 
citon states, we propose that it may well be an indirect 
gap material when judged by its excitation spectra. 

Our paper is organized as follows. In Section II we 
describe the model we employ for quasiparticle bands 
and for electron-hole interactions, and in Section III we 
present our results. We conclude in Section IV with a 
summary and brief discussion. 


II. THEORETICAL FORMULATION 

Exciton states are obtained by solving a two-particle 
problem with attractive interactions between one conduc¬ 
tion band electron and one valence band hole. Because 
the valence and conduction band edges are dominated 
by Mo atom d-orbitals we use a five band d-orbital tight- 
binding model, detailed in Appendix |A| and illustrated in 
Fig.[T} for the quasiparticle bands of the TMD semicon¬ 
ductor ground state. Provided that the typical separa¬ 
tion between the electrons and holes in exciton states is 
at least several lattice constants we can assume that the 
electron-hole interaction strengths are dependent mainly 
on the separation between atomic sites and not on the d- 
orbital character on that site. These considerations lead 
to a Hamiltonian of the form 

H = H 0 + H U 

Hl= 2^1 V \ R-R'\ a] R v a] R' v ' a R' V ' a Ru- 
R,R' 
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FIG. 1: (Color online) Quasiparticle band structure of mono- 
layer M 0 S 2 . The solid curves were obtained using the 
Quantum ESPRESSO packag^Swith fully relativistic pseu¬ 
dopotentials under the Perdew-Burke-Ernzerhof generalized- 
gradient approximation, and a 16 x 16 x 1 fc-grid. The dashed 
curves were calculated from the tight-binding model, with 
cyan (red) representing states that are even (odd) under mir¬ 
ror operation with respect to the Mo plane. iu ,2 and ci ,2 la¬ 
bel the bands close to the valence and conduction band edges 
near the K and I\' points. The inset shows the hexagonal 
Brillouin zone (pink) associated with the triangular Bravais 
lattice of M 0 S 2 and an alternate rhombohedral primitive zone 
(black), and labels the principle high-symmetry points in re¬ 
ciprocal space. Note that the valence band maxima at T is 
only slightly lower in energy than the valence band maxima 
at K , K' 

where Hq is the Hamiltonian for independent d electrons 
and Hi describes their interactions. In Eq. (JTJ) { a R V ) 
is the electron creation (annihilation) operator for orbital 
v at Mo site R and v = (o, s) includes both orbital o and 
spin s labels. We follow recent worfpOM by us i n g an 
interaction potential of the Keldysh forrriP^ 2 ™ 

V R = A H 0 (R/r 0 ) - Y 0 (R/r 0 )], (2) 

2 er 0 

to account for the finite width of the TMD layer and 
the spatial inhomogeneity of the dielectric screening en¬ 
vironment. This interaction gives a good description of 
the nonhy drogenic Rydberg series observed in monolayer 
WS 2 12^123 In Eq. ([ 2 ]) e is an environment-dependent di¬ 
electric constant, ro is a characteristic length related to 
the width of a single TMD layer, and Hq and To are re¬ 
spectively Struve and Bessel functions of the second kind. 
Unless otherwise stated, we chose e = 2.5, which corre¬ 
sponds to M 0 S 2 lying on a SiC >2 substrate and exposed 
to air. ro depends on e and we took ro = 33.875A/e 
from Ref. [10| . The onsite interaction is regularized by 
setting Vo = UVr =11 0 with cio equal to the lattice param¬ 
eter of M 0 S 2 , and the parameter U is taken to be 1 for 
results presented below. The dependence of our results 
on the value chosen for the dimensionless parameter U, 
which accounts for screening of on-site potentials by re¬ 
mote bands, will be discussed later. 


Exciton states with center of mass momentum Q can 
be expanded in terms of one-electron/one-hole states: 

\x)q = '52' l l J Q( v , c ,k) \v,c,k,Q), (3) 

v,c,k 

where | v,c,k,Q) = b'-. - bt\G ), |G) is the neutral 

[K-f-Q )c 

semiconductor ground state, and the sums are over all 
valance (v) and conduction (c) bands, bf and bt are 

v ' w kn kn 

quasiparticle operators for band n at momentum k. The 
wave vector k + Q is understood to be reduced to the 
Brillouin-zone. The exciton center-of-mass momentum 
Q is also understood to be confined to the Brillouin-zone 
and is a good quantum number. Like the quasiparticles, 
excitons have a band structure. To characterize an ex¬ 
citon state, we define its fc-space probability distribution 
function as 

P $(k) =J2\^(v,c,k)\ 2 . ( 4 ) 

V,C 

The eigenvalue problem for the Hamiltonian matrix 
projected onto this subspace is a Bethe-Salpeter (BS) 
equation. Its solution determines the exciton energies 
E^ and wave functions. The Hamiltonian matrix 

(v, c, k, Q\H\v’, d , k', Q) 

=Svv'S cc 'd ££, (£(g+Q)c “ £ kv) ~ (D - X) vv ,(k, k , Q), 

where £r denotes the quasiparticle energy. We view the 
two-dimensional bands predicted by density-functional 
theory electronic structure calculations, illustrated in 
Fig-0 as solutions of the neutral semiconductor single¬ 
particle Dyson equation including all many-body self¬ 
energy effects except for finite lifetimes. It follows that in 
the exciton calculation we need to account only for cor¬ 
rections due to electron-hole interactions. It will, how¬ 
ever, be necessary to correct for the well-known ten¬ 
dency of density-functional-theory bands to underesti¬ 
mate semiconductor gaps. As discussed later, this con¬ 
sideration motivates shifting the calculated excitation en¬ 
ergy spectrum rigidly to match experimental optical ab¬ 
sorption spectra. 

In Eq. (J5j), D and X are respectively the direct and 
exchange two-particle matrix elements: 

Dyi'ik, k',Q) = 

Xvv’ (k,k',Q) = L V ^uI + ^)JuIm« + q) v , c> , 

where Wj is the unitary matrix which diagonalizes the 
quasiparticle Hamiltonian (see Appendix [A}, N is the 
number of unit cells in the finite system over which we 
apply periodic boundary conditions, and e lq R VR 

is the lattice Fourier transform of the interaction poten¬ 
tial. Note that V(q) = V(q + G) for any reciprocal lat¬ 
tice vector G. For the special case Q = 0, the exchange 
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FIG. 2: (Color online) (a) Energies of Type III excitons as a function of center-of-mass momentum Q. This figure is based 
on a calculation performed using a 45 x 45 fc-grid. The lines were added as a guide to the eye. Solid (dashed) lines represent 
states that are doubly(singly) degenerate. The labels of the excitons with Q = 0 are explained in the main text. Excitons 
with Q = K are labeled by Xj< an< i so on i n ascending order of energy. The left inset is a fe-space map plot of (k) (see 
Eq. (JtJ) ) for the Q = exciton in the lower-energy branch evolving from A. The right inset schematically illustrates the 
dominant electron-hole transitions which contribute to the Xk, Xk an< i Xk exciton states, (b) Binding energy Eb for A, B and 
A 2 s excitons at Q = 0 as a function of N 1 ^ 2 , where N is number of k points. 


term A' vanishes because of the orthogonality property 

cv = 0. We remark that exchange term survives 

even at Q = 0 in models in which electron-hole interac¬ 
tions depend not only on electron-hole separation, but 
also on orbital characteiP^ 1 . 

III. EXCITON BAND STRUCTURE 

Monolayer M 0 S 2 has mirror symmetrjP 9 with respect 
to the Mo plane. Quasiparticle band spinors can be clas¬ 
sified by this symmetry A4\k,n) = —im n \k,n), where 
the mirror number m n = +(—) for mirror even (odd) 
bands as shown in Fig. |T] Using mirror numbers, we can 
group exciton states into three decoupled types: (1) A 
Type I exciton is formed by promoting an electron from 
a mirror-even valance band (m, = +) to mirror-odd con¬ 
duction bands (m c = —); (2) Type II is similar to Type I 
but with (m v ,m c ) = (—,+); (3) For a Type III exciton, 
m v = m c = ±. Only Type III excitons can be optically 
bright. Exchange terms vanish in Type I and II excitons 
because their valence and conduction bands have oppo¬ 
site mirror numbers. For a Type III exciton, the two 
sectors m v = m c = + and m v = m c = — are coupled 
by exchange terms, but not by direct terms. In the fol¬ 
lowing, we restrict our discussion to Type III excitons, 
although many of the points we make apply equally well 
to Type I and Type II excitons. 

We have solved the BS equation by applying periodic 


boundary conditions that restrict A: to a regular discrete 
grid in the primitive zone illustrated in Fig. [T] Our main 
results are summarized in Fig. [2ja) , which shows the en¬ 
ergies of Type III excitons as a function of center-of-mass 
momentum Q. To test the convergence of our calcula¬ 
tions with respect to the k -space sampling density, we 
plot the binding energies of low-energy excitons as a func¬ 
tion of periodic system size in Fig. [2j)b) . We start by an¬ 
alyzing excitons at and close to the T point (Q = 0), and 
then discuss the nearly degenerate low-energy excitons 
Q = K, K'. 


M 0 S 2 has a 3-fold rotational symmetry which can 
be used to classify excitons with Q = 0: C 3 |x) = 
exp(—where the quantum number £ takes on 
the discrete values £ = —1,0,1. An exciton state |x+) 
with £ = 1 has a time reversal (TR) partner |x~) with 
the opposite £ = — 1. The combination of C 3 and TR 
symmetries guarantees that the TR pair |x_t-) an d |x~) 
are degenerate in energy. Breaking either C 3 or TR 
symmetry can lift this degeneracy^SlM]. The optical se¬ 
lection rule for circularly polarized light is related to 
C 3 symmetrjMteSl (X+LMG) = (x-b+|G) = 0, where 
j± = jx ± ijy is the current operator. It follows that 
polarization-dependent optical studies can can be used to 
infer the £ quantum number of a bright exciton. £ = 0 
excitons are optically inactive (xo|j±|G) = 0 . 
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FIG. 3: (Color online) (a) Real part of the optical conductiv¬ 
ity with (solid red curve) and without (dashed green curve) 
electron-hole interactions. For these calculations the BS equa¬ 
tion was solved on a 51 x 51 fc-grid and transitions were broad¬ 
ened by 20meV. The solid green arrow indicates the quasipar¬ 
ticle band gap. The two dashed gray arrows mark the ener¬ 
gies of the A 2v excitons. Note that we have rigidly shift the 
excitation energy spectrum by a constant, so that the A ex- 
citon energy is at 1.93eV as measured by photoluminescence 
experiments® 11 ^. (b)-(e) fc-space maps of Tq_ 0 (fc) for low- 
energy excitons. (b)The £ = 1 exciton |A+). (c)-(d) Two 
non-degenerate A 2 P excitons in valley K. (e) The £ = 1 A 2 s 
exciton. 


A. Optical Conductivity 

Exciton states at Q = 0 are most easily studied exper¬ 
imentally because they contribute to the optical conduc¬ 
tivity. Fig. [3](a) plots the real part of the in-plane optical 
conductivity which has a number of clear features. Peak 
A stems from the doubly-degenerate excitons expected 
from the symmetry analysis given above. Fig. [3])b) illus¬ 
trates Pq =0 (k ) (see Eq. (j4j)) for |A+), the £ = 1 exciton 
of peak A. |A+) is dominated by electron-hole transitions 
from valence band V\ to conduction band c 2 in valley I\ 
(see Fig. [I]), while its TR partner | AJ) is primarily com¬ 
posed of similar transitions in the opposite valley K'. 
Excitons in the B series are similar to those in A, and 
are dominated by transitions from band V 2 to C\ in valley 
K ( K '). The lowest energy A and B excitons are analo¬ 
gous to the Is states of a 2D hydrogenic model. Fig.[3])a) 
also shows 2s and 3s peaks identified in the A series, and 
a 2s peak identified in the B series. 

Our calculation predicts that the lowest energy A ex¬ 
citons have a binding energy ~ 0.3 eV, in agreement 


with the estimate in Ref. m- As shown in Fig. gb), 
a 33 x 33 fc-grid already provides good convergence for 
the lowest energy A and B excitons whereas, because 
more weakly bound excitons have sharper structure in 
momentum space as illustrated in Fig. [Ije), the A 2s ex¬ 
citon requires a finer fc-grid for convergence. 

The energies of s-wave excitons in the A(B) series de¬ 
viate strongly from the (n— 1/2)“ 2 pattern of 2D hydro¬ 
genic models. This is partly due to the effective electron- 
hole interaction potential(Eq. , which differs from the 
standard Coulomb interaction because of the finite width 
of the TMD layer. We also find that A 2p states have a 
lower energy than A 2s states. This anomalous energy or¬ 
dering is consistent with recent experimental and theoret¬ 
ical studies of monolayer WS 2 3 '. More interestingly, the 
A 2 p states do not have the 4-fold degeneracy expected in 
two-valley systems. In fact, there are two non-degenerate 
A 2p states within each valley, as illustrated in Fig. |3j)c) 
and |3])d). This feature results from the dependence of 
band state wave functions on momentum direction near 
K and K' valleys and is closely related to similar proper¬ 
ties of excitons in massive Dirac equation band models, 
which we explain in detail in Appendix^] The A- 2p states 
are not optically bright in one-photon spectra, but can be 
detected using two-ph oton techniques like those achieved 
in recent experiments! 37 * 38 ! We therefore expect that the 
energy splitting within the A 2p states can be experimen¬ 
tally measured. 


B. Valley Coherence 

Valley coherence can be externally generated using lin¬ 
early polarized lightP, and can also be intrinsically in¬ 
duced by electron-hole exchange interactions. By treat¬ 
ing finite Q terms in the BS equation(Eq. §) as a first- 
order perturbation acting on valley-degenerate excitons, 
we arrive at the effective Hamiltonian 

H f = ^° + r ° + J Q T ° (7) 

+J^[cos( 24 >^)t x +sin(2 4 >q)t v ]. 

Here ojq is the exciton energy at Q = 0, and M is the ex¬ 
citon mass, tq and T x<y are respectively identity and off- 
diagonal Pauli matrices in valley space. J^ tq originates 
from intra-valley exchange interactions, while inter-valley 
exchange interactions act as an in-plane pseudo-magnetic 
field in the valley space and are captured by the second 
line of Eq. |7j). The dependence of H°A on q the orien¬ 
tation angle of the 2D vector Q , follows from the wave- 
vector dependence of the conduction and valence band 
states near K and K'. Ref. [221 and [3U; have studied 
the inter-valley exchange interaction and show that, 

J^cx |^(0)| 2 Q 2 F c5 , (8) 

where | (0) | 2 is the probability that an electron and 

a hole overlap spatially. In the small Q limit, Vq oc 
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1/(<2(1 + roQ)) for the potential in Eq. <H- Therefore, 
J^ scales linearly with Q in the long wave-length limit. 
We note that while inter-valley exchange interaction en¬ 
dows finite-momentum excitons with chirality I = 2 as 
pointed out by Ref. [30], intra-valley exchang e ! 22 ! 35 * 40 ^ is 
also important especially in regard to the exciton energy 
dispersion. 

Equation |7]) is derived from the massive Dirac model 
approximation to the quasiparticle band structure near 
K and K’ ! 22 l 3 °l Our lattice calculation verifies this low- 
energy effective theory. Fig. H a ) shows that there are 
two non-degenerate energy branches which evolve from 
the double-degenerate Q = 0 A, B and A 2s excitons. In 
each energy branch, the exciton state is a coherent super¬ 
position of direct excitons at the two valleys, as demon¬ 
strated in the left inset of Fig. Ha). The lower and upper 
energy branch have respectively quadratic and linear dis¬ 
persion in the long wave-length limit, in agreement with 
the prediction of Eq. 0. Unlike their s-wave cousins, 
branches evolving from valley-degenerate Q = 0 A 2p ex¬ 
citons remain doubly degenerate at finite momentum be¬ 
cause |^/) e / l (0)| 2 is zero for p -wave excitons and exchange 
interactions therefore vanish. 

Photons with the energy of an A or B exciton can at 
most provide a momentum with magnitude ~ uia(b)/c, 
where c is the speed of light. According to our calcu¬ 
lation, this momentum corresponds to an energy split¬ 
ting of 0.4(0.5) meV between the two energy branches 
evolving from A(B ), and a period of ll(8)ps for Rabi 
oscillation between the two valleys. We conclude that 
interaction-induced valley coherence provides an impor¬ 
tant mechanism for valley depolarizatiori-IK^U, j n addi¬ 
tion to that provided by impurity scattering. 


C. Brillouin-zone corner Excitions 

Large-momentum excitons composed of electrons and 
holes in opposite valleys can have an energy similar to 
those with zero momentum. In Fig. Ha), the Xk and Xk 
excitons have center-of-mass momentum Q = K when 
reduced to the first Brillouin-zone. They are dominated 
respectively by transitions between valence band holes of 
V\ and v 2 states in valley K and conduction band elec¬ 
trons of ci and c 2 states in the opposite valley I\'. The 
Xk and Xk excitons are the Q = K counterparts of the 
A and B excitons and have nearly the same energy. The 
small differences have two origins, the energy splitting be¬ 
tween ci and c 2 bandJ®, and a change in the exchange 
interaction. 

However another set of excitations appear at the same 
crystal momentum. Low energy excitations at Q = K 
also originate from holes in the T valley and electrons in 
the K valley, as illustrated in the right inset of Fig. H a )- 
In our lattice calculations Xk an d Xk are suc h excitons. 
If we neglect the spin splitting of both topmost valence 
bands at T and the lowest conduction bands at the I\ 
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FIG. 4: The energy difference between the large-momentum 
exciton an d zero-momentum exciton A as a function of 
dielectric constant e. The calculation was performed on a 
45 x 45 fc-grid. Xx is the triplet exciton state with a hole in 
F valley and an electron in K valley. 


valley, the corresponding excitons can be classified as sin¬ 
glets and triplets according to their spin configurations. 
Singlets experience strong electron-hole exchange inter¬ 
actions, because the valence band maximum at T and 
the conduction band minimum at K have the same d z 2 
orbital character. The exchange energy is proportional 
to Vq =k = {U — 1.28)Vr =( 1o . The exchange interactions 
effectively vanishes for triplets because of their particular 
spin structure. In our calculation, U is set to be 1 and the 
exchange energy for singlets is therefore negative. There¬ 
fore, we can identify Xk as the singlet state and x’k as 
one of the triplet states. Depending on the value of U 
used in our model, the lowest energy excitons can occur 
at Q = 0, corresponding to a direct gap system, or at 
Q = K, K' corresponding to an indirect gap system. Be¬ 
cause the appropriate value which should be used for U 
depends on electronic correlations at the atomic level and 
on screening by remote bands not included in our calcu¬ 
lation, we are not able to reach a definitive conclusion 
as to whether or not the singlet Xk * s the lowest energy 
exciton. However, the energy of the triplet Xk does not 
suffer from such uncertainty. The valence band effective 
mass at the T point is heavier than that at the K point^. 
Electron-hole pairs are therefore bound more strongly in 
X% than in the A exciton, which compensates for the 
energy difference (68meV) between the topmost valence 
bands at T and K points, and makes the energies of the 
triplet Xk an d A excitons very close to each other. By 
decreasing the dielectric constant e to 1, we find that the 
triplet x%c becomes lower in energy than the A exciton by 
19meV, as illustrated in Fig. [I] The energetic ordering 
of direct and indirect exciton states in single-layer TMDs 
could therefore depend on the two-dimensional system’s 
three-dimensional dielectric environment. 
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IV. SUMMARY AND CONCLUSIONS 

In summary, we have constructed a lattice model based 
on Mo d-orbitals to study exciton band sturcture of 
monolayer M 0 S 2 . Zero-momentum excitons have non- 
hydrogenic energy series, because screening effect has a 
spatial dependence, and band edges in K and K' val¬ 
ley are described by massive Dirac equation. The energy 
splitting within A 2p excitons remains to be measured us¬ 
ing methods such as two-photon technique. The exciton 
band structure exhibits non-degenerate energy branches 
evolving from valley-degenerate bright excitons, indicat¬ 
ing valley coherence. Such low-energy branches are well 
separated from the continuum spectrum, whi ch jus tifies 
the application of low-energy effective theorjEHED We 
find that low-energy excitons can possess a large momen¬ 
tum, either with electron and hole in opposite valleys ( K , 
K'), or with hole in T valley and electron in K(K') valley. 
Large-momentum low-energy exciton states can provide 
relaxation channels for bright excitons, and reduce pho¬ 
toluminescence quantum efficiency. 
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Appendix A: Explicit form of tight-binding model 

We approximate the quasiparticle-Hamiltonian matrix 
7 -Lj: by a tight-binding model which generalizes Ref. [05] 
from three to five d-bands: 

H i : = \L-S + l 2 ®'Ho(k). (Al) 

The first term XL ■ S describes the on-site atomic spin- 
orbit coupling of Mo d orbitals, where L and S are re¬ 
spectively the orbital and spin angular momentum. The 
coupling constant A = 0.073eV, which was adjusted to fit 
the valence-band spin splitting at the K point as detailed 
in Ref. [43 . The second term I 2 07fo(fc) is spin indepen¬ 
dent, where / 2 is a 2 x 2 identity matrix in spin space, 
and T~Lo{k) is a 5 x 5 matrix in orbital space. Because 
of the mirror symmetry with respect to the Mo plane, 
7io(k) is block-diagonal: 

77jj ven (A) is a 3x3 matrix in the bases 
{\d z 2 ),\d xy ),\d x 2 - y 2 )}. Similarly, %g dd (fc) is a 2 x 2 
matrix in the bases {| d xz ), \d yz )}. 


For "Hg ven (fc), we adopt a model constructed in Ref. [151 
The construction uses point-group symmetries to mini¬ 
mize the number of parameters, and the parameters are 
fitted from first-principle energy bands. An explicit form 
of r HQ Ven (k) with hoppings up to third-nearest-neighbor 
is given in Eqs. (13) to (24) of Ref. [43] We generalize the 
symmetry-based method used in Ref. 1431 to construct the 
remaining part of the Hamiltonian ^§ dd (A), and its form 
is: 


R° dd (fc) 


| h x {k) h X y(k)\ 
\K y {k) hy(k)J ’ 


(A3) 


in which 

h x ( k) =Oi + 2t cos 2a + (t + 3t') cos a cos /3 
+4s cos 3a cos /3 + (3s 7 — s) cos 2/3 
+2 u cos 4a + [u + 3 u) cos 2a cos 2/3, 
h y (k) =Oi + 2t' cos 2a + [t' + 3 1) cos a cos /3 
+4s' cos 3a cos /3 + (3s — s') cos 2/3 
+2 v! cos 4a + (y! + 3 u) cos 2a cos 2/3, 
h xy (k) =4:it xy sin a(cos a — cos /3) (A4) 

+V / 3(f / — t ) sin a sin [3 
+ 2 V3(s' - s) sin a sin /3(1 + 2 cos 2a) 

+4 iu xy sin 2a(cos 2a — cos 2/3) 

+V3 ( 1 / - u) sin 2a sin 2/3, 

(«>/?) =(^k x a 0 , ^ k y a 0 ). 


Hoppings in real space up to third nearest neighbors are 
included. The numerical value of the parameters in unit 
of eV is: 


Oi =3.558, 

t = - 0.189, f' = -0.117, t xy = 0.024, 
s = -0.041, s' = 0.003, ( ’ 

u =0.165, v! = — 0.122, u xy = —0.140, 

which are obtained by fitting to first-principle calcula¬ 
tions as shown in Fig. [T| 

H £ is diagonalized by the unitary matrix (Eq. §), 
and the corresponding eigenvalues are quasiparticle en¬ 
ergies £ %n . 


Appendix B: Massive Dirac model for excitons 

In this appendix we study excitons in the massive Dirac 
model, and show that this simple model captures many 
important features of the Q = 0 excitons in monolayer 
MoS 2 . In the vicinity of K or K' point, the k ■ p Hamil¬ 
tonian for two bands (c 2 , iq) is described by the massive 
Dirac model 3 : 

H T {k) = hv F k[cos($j:)ax + sin($^)aj / ] + A a z , (Bl) 
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FIG. 5: (Color online) Binding energy Eb in massive Dirac 
model for Is, 2s, and 2 p excitons in valley I\ (r = 1) as a 
function of dielectric constant e. The parameter values are 
hv F = 1.105eV x 3.193A, A = 0.7925eV and r 0 = 33.875A/e. 
The inset table compares Eb obtained respectively from mas¬ 
sive Dirac model (MD) and lattice model (LM) for e = 2.5. 


where <J x ,y,z is the Pauli matrices for the basis function 
at K or I\' point, and A is the energy gap. The angle 
$£ is defined as cot $£ = rk x /k y , where r = ±1 labels 
valley K and K'. The conduction and valence band are 
described by spinors: 


| G kJj r 


( cos(%) 
ysin(^ L )e I<I, fc 


I V,k) T 


f sin(^)e 

V -cos(^) ) ’ 


(B2) 

where angle 9 k is defined as cos 9k = A /e k with e k = 
\/A 2 + ( hi’Fk ) 2 . 

As discussed in the main text, inter-valley coupling is 
nearly absent for Q = 0 excitons. Within each valley, the 
kernel of the BS equation in Eq. ([5]) can be expressed in 
terms of band spinors: 


FIG. 6: (Color online) Binding energy Eb in massive Dirac 
model with standard Coulomb interaction(ro = 0, F(q) = 1) 
for Is, 2s, and 2p excitons in valley K (r = 1) as a function 
of fine structure constant a. 


instead of (Eq. (B1)) so that the valley dependence of 
K r is explicit. The interaction potential has the following 
form 


K = (B4) 

where the form factor F(q ) = 1/(1 + r$q) modifies the 
Coulomb interaction in consistency with the real-space 
interaction potential (Eq. <§)• The BS equation reads: 

= Ei/j(k). (B5) 

k' 


We can define effective Bohr radius a* B , Rydberg en¬ 
ergy Ry* and fine structure constant a: 


1C T (k, k') =S^,T k - D T (k, k'), 

Tk =2e k , 

D r (k, k') =^V%_p (r(c, k\c, k') TT (v, k'\v, fc) T ) 

[(1 +cosd fc )(l + cos6fo) 
+ 2 sindfc sin 9 k 'e lT ^k , ~ ( l , A 


+ (1 — cos0fc)(l — cos6 f fc')e l2r ^ E ' , 

(B3) 

where T k can be understood as the kinetic energy. 


D T (k , k') is the electron-hole direct interaction, while the 
exchange interaction X (Eq. |5])) is neglected. A is the 
area of the 2D system, and (j>^ is the orientation angle 
of k with cot <f>t = k x /k y . In Eq. (B3), angle is used 


a 


* _ 

B ~ 


2 e(hvF) 2 
e 2 A 


,Ry* 


1 e 2 e 2 

2 ea* B ’ ehvF 


(B6) 


For notation convenience, we also define parameter p = 
(a/2) 2 . 

After taking a* B as unit of length and Ry* as unit of en¬ 
ergy, and using an ansatz ip(k) = ip(k)e llrl> z , BS equation 
is reduced to the following ID eigenvalue problem: 


Eip(k) = T k ip(k) 

/»00 

- / dk' [(1 + cos 9 k )(l + cos9k>)I k k'{l) 

Jo 

+2 sin 9 k sin 0 k > I k k' (l + r) 

+(1 - cos0 fe )(l - cos 6 k >)Ikk'(l + 2r)]fcV(fc') 


(B7) 






















where T*,, cos Ok and Ikk'(l) in effective atomic units are: 


Tfc = - \Jl + /3k 2 , cos Ok = 1/v 7 1 + /3fc 2 , 


4fc'(0 = ^- I*” 

^ Jo \/ 


k 12 — 2 kk' cos (j>) cos It/) 


\Jk^ + fc' 2 — 2fcfc' cos ^ 


(B8) 

Eq. (B7) makes it clear that within the same valley ex- 


citons with quantum number l and —l are not degenerate 
in energy, because of the wave-vector dependence of the 
band spinors. However, there is a degeneracy between 
(r, l) and (-r, -1) excitons, which originates from time 
reversal symmetry. 

We apply the massive Dirac model to excitons in A 
series of monolayer M 0 S 2 . The appropriate parameter 
values are Hvf = 1.105eV x 3.193A, A = 0.7925e\® and 
v q = 33.875A/^2l. The effective atomic units then take 
the following value, a* B = ex 2.18A and Ry* = 3.3 eV/ e 2 , 
and the fine structure constant a = 4.075/e. Eq. (B7) is 
solved numerically by discretizing the ID fc-space. The 
result is presented in Fig. [5j which depicts the binding 
energy of Is, 2s, and 2 p excitons in valley K as a func¬ 
tion of dielectric constant e. It reproduces all essential 
features of Q = 0 excitons discussed in the main text: 


(l)binding energies deviate from the pattern of the 2D 
hydrogen model; (2)2 p states have a larger binding en¬ 
ergy than 2s; and (3)there is an energy splitting between 
l = il 2p states within the same valley. Moreover, the 
binding energies calculated by using the Massive Dirac 
model and the lattice model of the main text are close to 
each other as shown in the inset table of Fig. [5] 


Finally, we study the massive Dirac model with stan¬ 
dard Coulomb interaction by taking form factor F(q) to 
be 1. In this case, the fine structure constant a controls 
the deviation of the massive Dirac model from the 2D 
hydrogen model. Fig. [6] presents the binding energy as 
a function of a. In the limit of weak electron-hole in¬ 
teraction (a —>• 0), the massive Dirac mo del reduc es t o 
the 2D hydrogen model as implied by Eq. (B7) and (B 81 . 
Therefore, Eb{ls) = 4Ry* and Eb{2s ) = Eb(2p) = ( Ry* 
as a goes to 0. 2s and 2 p states remain nearly degener¬ 
ate for a < 0.4, and develop prominent energy splitting 
at large a. The energy ordering is Eb(2p,l = —1) > 
Eb(2s) > Eb(2p,l = +1) in valley K. Note in the case 
of monolayer M 0 S 2 where interaction potential is modi¬ 
fied by F(q) (Eq. (B4|), all 2 p states have bigger binding 
energies than 2s states as shown in Fig. [5j 
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